AITopics | Marathon County

Collaborating Authors

Marathon County

OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning

Fu, Ling, Yang, Biao, Kuang, Zhebin, Song, Jiajun, Li, Yuzhe, Zhu, Linghao, Luo, Qidi, Wang, Xinyu, Lu, Hao, Huang, Mingxin, Li, Zhang, Tang, Guozhi, Shan, Bin, Lin, Chunhui, Liu, Qi, Wu, Binghong, Feng, Hao, Liu, Hao, Huang, Can, Tang, Jingqun, Chen, Wei, Jin, Lianwen, Liu, Yuliang, Bai, Xiang

arXiv.org Artificial IntelligenceDec-31-2024

Scoring the Optical Character Recognition (OCR) capabilities of Large Multimodal Models (LMMs) has witnessed growing interest recently. Existing benchmarks have highlighted the impressive performance of LMMs in text recognition; however, their abilities on certain challenging tasks, such as text localization, handwritten content extraction, and logical reasoning, remain underexplored. To bridge this gap, we introduce OCRBench v2, a large-scale bilingual text-centric benchmark with currently the most comprehensive set of tasks (4x more tasks than the previous multi-scene benchmark OCRBench), the widest coverage of scenarios (31 diverse scenarios including street scene, receipt, formula, diagram, and so on), and thorough evaluation metrics, with a total of 10,000 human-verified question-answering pairs and a high proportion of difficult samples. After carefully benchmarking state-of-the-art LMMs on OCRBench v2, we find that 20 out of 22 LMMs score below 50 (100 in total) and suffer from five-type limitations, including less frequently encountered text recognition, fine-grained perception, layout perception, complex element parsing, and logical reasoning. The benchmark and evaluation scripts are available at https://github.com/Yuliang-liu/MultimodalOCR.

large language model, machine learning, pattern recognition, (20 more...)

arXiv.org Artificial Intelligence

2501.00321

Country:

South America (0.04)
Africa (0.04)
North America > United States > Wisconsin > Marathon County > Wausau (0.04)
(5 more...)

Genre: Research Report (1.00)

Industry:

Education (0.67)
Banking & Finance (0.45)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.91)
Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (0.68)

Add feedback

Immunis Appoints Neil Sahota as Chief Artificial Intelligence Officer

#artificialintelligenceApr-17-2023, 10:42:49 GMT

Immunis, a private biotechnology company developing a novel treatment for age and disease-related immune decline, has appointed Neil Sahota as its Chief Artificial Intelligence (AI) Officer. AI is a tool that is transforming the way businesses and scientists integrate information, conduct data analysis, and make informed decisions on how to optimize growth. For over 20 years, Neil has inspired AI modernization through technology-based business strategies and has been successful in helping businesses become leaders in the digital future. Immunis is confident that Neil will guide the company to unlock the considerable potential of AI in biotech. AiThority Interview Insights: AiThority Interview with at Brian Sathianathan, Co-Founder and CTO at Iterate.ai Immunis is confident that Neil will guide the company to unlock the considerable potential of AI in biotech.

aithority interview, chief artificial intelligence officer, immunis appoint neil sahota, (3 more...)

#artificialintelligence

Country:

North America > United States > Wisconsin > Marathon County (0.07)
Asia > China > Guangxi Province > Nanning (0.07)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.40)

Add feedback